Skip to content

feat(benchmark): print scenario runtime, status and prompt for viewing extended#152

Merged
ross-rl merged 4 commits into
mainfrom
ross/spin2
Mar 6, 2026
Merged

feat(benchmark): print scenario runtime, status and prompt for viewing extended#152
ross-rl merged 4 commits into
mainfrom
ross/spin2

Conversation

@ross-rl

@ross-rl ross-rl commented Mar 6, 2026

Copy link
Copy Markdown
Contributor

Description

Command Rename: status → summary

  • Renamed benchmark-job status to benchmark-job summary
  • File renamed: status.ts → summary.ts

New -e, --extended Flag for Summary

  • By default, rli benchmark-job summary shows only the aggregate table (agent scores)
  • With -e flag, shows individual scenario results underneath each agent

Watch Command Improvements

Elapsed time per scenario:

  • Each running scenario now shows how long it's been running (e.g., running (2m 30s))

Completion behavior:

  • On job completion, exits full-screen and shows:
    • "Benchmark job completed!" message
    • Total elapsed time
    • Summary table (without individual scenarios)
    • Hint: To see full results, run: rli benchmark-job summary -e <job_id>

Display improvements:

  • Added job ID line under the job name header
  • Added terminal resize handler to prevent display artifacts
  • Content now truncates to fit terminal height (header stays visible, progress truncated from bottom)

Type of Change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Documentation update
  • Code refactoring
  • Performance improvement
  • Test updates

Related Issues

Closes #

Changes Made

Testing

  • I have tested locally
  • I have added/updated tests
  • All existing tests pass

Checklist

  • My code follows the code style of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have updated the documentation accordingly
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • Any dependent changes have been merged and published

Screenshots (if applicable)

Additional Notes

@james-rl james-rl left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm. Some suggestions for minor improvements but nothing major

Comment thread src/commands/benchmark-job/summary.ts
Comment thread src/commands/benchmark-job/watch.ts
@ross-rl ross-rl merged commit 32f4f72 into main Mar 6, 2026
14 checks passed
@ross-rl ross-rl deleted the ross/spin2 branch March 6, 2026 20:01
ross-rl pushed a commit that referenced this pull request Mar 6, 2026
🤖 I have created a release *beep* *boop*
---


##
[1.13.0](v1.12.0...v1.13.0)
(2026-03-06)


### Features

* add benchmark list features
([#154](#154))
([2abe85c](2abe85c))
* **benchmark:** add status breakout during status -w
([#149](#149))
([393548e](393548e))
* **benchmark:** add watch with term takeover / interactive display
([#151](#151))
([c49ee7a](c49ee7a))
* **benchmark:** print scenario runtime, status and prompt for viewing
extended ([#152](#152))
([32f4f72](32f4f72))
* **benchmark:** support multi agent jobs
([#145](#145))
([41ec8d3](41ec8d3))


### Bug Fixes

* remove spurious warning for optional secrets
([#146](#146))
([3abea8a](3abea8a))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants